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L INTRODUCTION 


1 


visited 


Approximately 6 months ago representatives 
the Office of Central Reference (OCR) and spent some time in examination of its opera- 
tion. The purposes of this visit were first, to develop an understanding of the operation 


could be more 


of OCR with which the developmental activities 

suitably guided, and second, to provide OCR with the benefit of whatever ideas. were: stim- 
ulated by the visit. 


The following report is a discussion of the thinking which resulted from that 
visit. In no sense should it be considered an adequate system analysis of the present 
system and procedures. The examination was entirely too brief and much of the per- 
tinent data must have been missed or misevaluated. Furthermore, the report does not 
discuss specific equipment or techniques of mechanization as such. In order for any 
general recommendations concerning mechanization to have concrete realization they 
must eventually be im plemented by specific pieces of equipment. In this respect, [ 


^ direct result of discussions with OCR, is prepared to propose 

such equipment and to work with CIA in efficiently mechanizing OCR operations, if this 
18 deemed desirable; however, it was felt that equipment proposals, as such, were out- 
side the purpose of this report. 


The report consists of essentially three parts. The first is a brief summary 
of the survey results themselves. The second discusses our reaction to the present 
status of OCR. The final section is concerned with recommendations concerning 
mechanization. 


The significant conclusions can be summarized as follows: (1) In general, the 
present system of operation was considered to be reasonably efficient in terms of the 
existing conditions, i. e. , the present work load and prescribed techniques for mechan- 
ization; (2) On the other hand, the immediate reaction at the time of the survey was 
that the existing conditions themselves are completely unsatisfactory and, in particular, 
that the present work load is shockingly small and the present techniques for mechan- 
ization unsuited to the job. Further thought has merely re-enforced the substance of 
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that initial reaction, m view of this, if OCR expects to continue with the present inform- 
ation retrieval work load, it is recommended that the present system of operation be re- 
placed by a manual one which wIU be more responsive to these requirements. (4) However 
it is very strongly felt that continuation of the present work load is completely undesirable 
and is inconsistent with the overall alms of CIA. H this feeling on our part is vaUd, then 
mechanisation of OCR is essential . Since the present mechanisation is completely in- 
adequate for any substantial increase in work load, serious study should be made of the 
requirements for more sophisticated mechanisation and the techniques available for their 

implementation. (5) B is felt that the work load placed on OCR and the problems m 

mechanisation of OCR cannot be considered Independently of the overall CIA operation, 
m particular. Increased usage of OCR depends upon its effective relationship with the 
analysis and production elements in CIA and with associated intelUgence agencies. 
Furthermore, common processing problems and overlapping information requirements 
must exist throughout CIA; it is essential that compatible data processing techniques 
also be made to exist. Consequently, the effective mechanisation of OCR must and 
should be considered in relationship to the mechanisation of the total CIA 
<6) This requires, as a minimum first step, a comprehensive study of the 
information processing problem, so that the relationship among various ““““S 
information systems, between them and the analysis and production elements of CIA. 
and between them and the other intelUgence agencies is clearly understood. 
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n. SUMMARY OF SURVEY 


A. LIMITATIONS ON THE SURVEY 

Our survey of the Office of Central Reference was a short three day examination. 

It is certain that there were areas which were missed and data which was misinterpreted 
by us. Furthermore, because of the requirements of security and the lack of contact 
with the users of OCR it is also certain that our understanding of OCR purpose and oper- 
ation is at best not precise. Under these conditions-brevity of examination, missed 
information, ill-defined purpose— it must again be emphasized that we cannot in any way 
carry out a "system analysis" of the present operation; hence, we are not at aU attempt- 
ing to suggest modifications in procedure, data flow, or other system considerations for 
the purpose of minor or even major improvement in present operations, hi fact, it is 
doubtful whether such an analysis is necessary since the present equipment seems to be 
used to its maximum efficiency anyway. 

B. THE FILES 


The Office of Central Reference is an information system based on one main file 
of documentary data and three associated registers: 1) The General File; 2) The Bio- 
graphic Register; 3) The Industrial Register; and 4) The Photographic Register. The 
contents of the first three of these are generally intelligence reports derived from CIA 
and other intelligence sources; a typical document, for example, may be a report by 
an attache associated with some diplomatic unit. The large mass of the inform 
stored is factual data with some degree of interpretation or evaluation. 

The files themselves are each large by almost any standards of comparison. 

For example, the General File presently contains approximately one million documents. 
The Biographic Register contains dossiers of various size covering 250, 000 individuals 
and involves an additional one and one-quarter million unfiled items. The Industrial 
Register covers 275, 000 industrial installations with nearly two million supporting doc- 
uments. The Photographic Register involves a large, but undetermined by the survey, 
number of individual photographs, prints and negatives, and rolls of motion picture 

film. 
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C- THE INPUT 
1. General File 

The input to the General File involves a standard processing pattern consisting 
of the following individual steps: First, the incoming documents, averaging some 
1, 500/ day, are screened for their individual relevancy and some 40% axe handled in 
a routine fashion with no further processing; Second, the remaining 60% are each read 
in some appreciable detail by a group of approximately 28 people for the purpose of 
specifying dissemination to interested organizations and analysts; Third, foUowing Hie 
dissemination, the documents are again read in detail by a group of 18 '‘coders", this 
time for the purposes of abstracting, classifying, indexing, and coding the individual 
documents for later retrieval. Each coder normally handles about 40 documents per 
day. This operation generally results in approximately six individual codes per docu- 

ment. 

Finally, the documents and the subject coding information are processed for 
insertion in the file. This processing itself involves a standard procedure as follows: 
the document is photographed on microfilm which is then prepared in the form of a 
punched card aperture card, identified by document number and some other relevant 
data. A bibliographic source description and a set of file cards is prepared, each 
containing the abstract in printed form and the relevant source, subject, or area code 
on which retrieval may be naade. 

The aperture cards containing the microfilm document image are stored in 
document number sequence and, therefore, correspond to an accessions sequenced 
storage system. The bibUographic source descriptions are stored in sequence by 
source identification. The individual file cards are sorted into subject code sequence 
or area code sequence and merged into their respective existing files. 

The aperture card file and the bibliographic source description file each con- 
tain approximately one card per document and thus each involve one million cards. 
These files grow at the rate of approximately 1, 000 new documents each day. The 
subject and area index files, on the other hand, together contain approximately 6 mil- 
lion cards (an average of 6 codes per document) and are growing at the rate of over 
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5, 000 cards per day. 

2. Biographic Register 

The Biographic Register is somewhat closer to an mtegrated IMormatlon retrleval- 
aaalysls operation than is the General File, since It involves the maintenance of a ^t of 

dossiers, the associated analytical work heing integrated with the file operation. Thtm 

is processed by the Biographic Register in a seqaence of steps: The first 
Involves the routing of new unfiled Information to a "geographic-area desk" where it is 
stored in strictly date sequence. The set of unfUed Informatton is analysed by tome 
experts for Its relevancy to the existing 250. 000 dossiers. Quick reference to the m- 
filed information is made available by a punched card tabulated Usting and a set of 5x 
file cards. When, as a result of the analysis process, previously unfiled information 
IS found to be relevant to existing dossiers. It is added to them. This resntts m t e 
addition of approximately 280. 000 pieces of new information to old cases. Fi^rmore, 
the munher of dossiers Itself increases by 25, 000 to 30. 000 new dossiers each year. 

Tb' dossiers are stored in sequence by assigned case numbers. Reference to 
them IS provided by three separate Indexes; the first are McBee key sort cards also 
stored in case number sequence; the second, an alphabettcal index by name m Soundex 
sequence; and the third, an alphabetical name index grouped by nationality. There is 
also a set of IBM punched cards used for preparation of tabular listmgs. T ese consis 
of "who-s who" cards arranged by case number within nationality and a set of category 
cards (averaging four per case number) grouped by professional categories. 


3. Industrial Register 

The Industrial Register is very similar in its purpose to the Biographic Register, 
except that it is concerned with industrial facilities. Its basic file of nearly ^o million 
documents is organized into a set of some 275. 000 "plant folders" arrange y geo 
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graphic area. This file of documents increases at the rate of 20, 000 to 25, 000 new docu- 
ments each month, each of which may refer to several installations. Summary informa- 
tion concerning the plants is provided by an industrial card file — a set of 5x8 cards each 
summarizing the significant information concerning a single plant. Access to the plant 
folders and the industrial card file is provided by a set of three indices, each of which 

^ separate items. The first list, called the "A” listing, is an 
alphabetical listing of geographic location (thus bringing together all plants 'wif lifj'n a givsn 
area). The second listing, called the "B” Listing, is an alphabetical listing of plant 
name and/or description. The third, called the "C Listing, is by plant activity. Up 
until July 1958 a set of punched cards, one for each plant, was maintained with a coding 
for single products of the plant. This operation has been discontinued. 


25X1 


Information is collated . into the file by an analytical process similar to that in- 
volved in the Biographic Register. 


4. Photographic Register 


Very little factual information was obtained concerning the Photographic Register. 
From the short description available, it involi/^es a manually maintained file of individual 
photographs and negatives. The p ositives are stored in geographic sequence with color- 
coded tabs providing a visual index 
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D. THE OUTPUT 


1. General File 

Information can be obtained from the General File either directly from the aper- 
ture card document-image section or by means of any of the index files — the biblio- 
graphic source description cards, the subject coded cards, or the area coded cards. 
Thus, requests for specific documents identified by number can be serviced directly 
from the aperture card file; requests in terms of source are referred to the biblio- 
graphic source file; and requests in terms of subject or subject and area combinations 
are answered through use of the subject-area index files. 
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No data was available on the number of requests serviced directly from the aper- 
ture card file or the bibliographic source file. However, the impression was gained that 
this represented an appreciable volume of activity, particularly when compared with the 
activity serviced by the two index files. 

Requests serviced by the subject and area index files average approximately 
4, 000 per year. The procedures in usage of these index files involve the following 
steps, some of which may not be present in handling particular requests: 

a) The requestor may discuss his requirements with a so-called "composite 
group" consisting of an expert familiar with the operation of the 
system, one familiar with the subject area coding system, and a trained 
librarian familiar with the problems of research reference. This compos- 
ite group, working together with the requestor, defines the search problem 
and establishes the criteria for mechanized decision. 

b) Cards are manually selected from sections of the index file considered to 

be relevant to the search. (This operation frequently results in the complete 
satisfaction of the request as a result of the manual search and the judgment 
of the searcher. Sometimes the unavailability of the 100 sorter for the next 
step forces such a manual decision to be made. ) 

c) An IBM-101 Statistical Sorter is wired for comparison of the punched in- 
formation in the manually selected cards with relatively complex combina- 
tions of additional subjects and areas. The result of this is a set of file 
cards satisfying the established criteria of the search. 

d) A bibliographic strip is prepared from this set of cards by a photographic 
reproduction of the abstract printed on each card. This process has in the 
past been called the Intellofax Output. 

e) This bibliographic strip is provided to the requestor for his further screen- 
ing of the documents on the basis of the abstract data. Specific documents 
requested are then retrieved from the aperture card file and reproduced for 
the requestor’s usage. 


SECRET 


Sanitized Copy Approved for Release 2009/11/23 : C1A-RDP82T00271R000100120001-9 


Sanitized Copy Approved for Release 2009/11/23 : CIA-RDP82T00271R0001 00120001-9 

dCWKC I 


8 

f) Meanwhile, the selected index cards are returned to the file area for eventual 
remerging in the index file. Several successive generations of intermediate 
files build up (apparently at times as many as six). This is essential for ef- 
ficient hgndliri g of the file in the remer ging process but is unfortunate from 
the standpoint of subsequent searches which must be made against each of 
these intermediate files to assure recovery of relevant data. 


2. Biographic Register 

The Biographic Register services approximately 5, 000 requests for information 
each year. These apparently are primarily handled either by direct reference to the 
appropriate index and resulting case number or by discussion with the expert at the rel- 
evant area desk. There is also generated a moderate volume of summary reports and 
listings disseminated to using organizations. 


3. Industrial Register 

The Industrial Register handles approximately 12, 000 requests for information 
each year. These requests can refer to industrial establishments, their cadre of per- 
sonnel, parameters of production, etc. 

4. Photographic Register 

No information was available concerning the volume of usage of the Photographic 
Register- 

E. SUMMARY 

In summary, the total reference file of the Office of Central Reference involves 
some three to five million separate documents organized in three main files. Indexed 
access to individual documents is provided by close to ten million index cards and half 
a dozen periodically published printed indices, each consisting of between one and two 

million entries. 

This rather large file (certainly one of the large information files in the country) 
handles less tban 25, 000 requests a year, i. e. , less than 100 a day. 
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m. SUMMARY OF REACTION 


A. PRESENT STATUS 


Our first and immediate reaction to the operation of the Office of Central Refer- 
ence was surprise combined with some degx*ee of disriiay at what we feel is the shockliL^ly 
small usage of its files. This initial feeling has been re-enforced by our later thinking, 
particularly in the preparation of this report. 


This statement certainly requires some degree of explanation. It was based on 
a number of considerations. Primary among these is the vital importance of the file, 
in terms of overall CIA and national objectives; to a certain extent this explains the 
feelings of dismay. The surprise resulted from comparison in our own mind of OCR 
with other systems of similar size and complexity (but presumably less vital importance) 
whose usage is greater by factors as large as 100. Among these might be included the 
files of the Title Insurance and Trust Company of Los Angeles, the files of the Medical 
Impairment Bureau of the Insurance Industry, the files of the Department of Motor 
Vehicles of the State of California, any one of several credit bureau files, the files of 
Time, Inc. , the files of the Police Department of Los Angeles, and the files of other 
intelligence activities. (It must be admitted that the nature of some of these files, but 
not of all of them, may be somewhat different from those of OCR. As discussed below, 
however, this should represent a reason for careful examination of the purposes in usage 
of the file, to determine how more effective usage can be made. ) 


Finally, a rather cursory examination of the costs and expenses involved in input, 
output, filing, and (perhaps most important) collections indicated that the cost of a search 
in OCR is perhaps 10 to 100 times greater than searches in other systems of varying size 
and complexity. This extreme cost is a direct result of the low usage. 

In summary, the immediate explanation for our reaction that the usage of OCR is 
shockingly small is the fact that it represents an enormous file which is vitally impor- 
tant and yet is almost completely unexploited. 
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B. WHY IS THE USAGE SO StVUiLL? 


After the initial reaction of surprise the next reaction was one of attempting to 
pvp ia in this anomaly. There seem to he at least five relevant factors j each of them 
somewhat related to the others so that the explanation is not a completely clean-cut 
one. Considering each of these factors in turn: 


1 ) 


First is the extremely effective dissemination provided by OCH. 


This is ob- 


viously an important function because of the need to provide analysts with 
relevant information at the earliest possible moment. The result of this 
effective dissemination is that the individual analyst presumably sees aU 
information relevant to his problem. There is a resulting r eduction there- 
fore In the tendency to use the files. 


2) Resulting from the fact that the dissemination presumably provides the 
analyst with all current information relevant to his interests is the natural 
tendency by the analyst to depend upon his memory and his personal files. 
Thus, he not only reads the daily "take" in terms of its relevancy to an im- 
mediate situation or research problem, but also remembers it and, in fa.ct, 
probably files the document itself in some readily accessible personal file. 
Now, certainly, each of these effects is desirable. It is only by the picture 
■ which the analyst builds up in his own mind that he is able to interpret the 
significance of new data; it is this comprehensive picture stored in the 
analyst's memory which makes him the expert and capable of quickly pro- 
viding aii evaluated answer to requests for Information. Furthermore, there 
is little question concerning the value to the analyst of having immediately 
available on his desk highly significant and current documentary information 
to re-enforce his memory and support his conclusions. 

However, desirable though these two factors of the analyst's reliance 
upon his memory and his personal files may be, it is evident that these are 
usually carried to extremes. Attempting to remember every relevant fact, 
most of them usually remote from immediate Interests, imposes a memory 
burden beyond the analyst's capacity. In establishing personal files the 
analyst is further burdening his memory with the necessity of remembering 
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not only the information from the documents he has read but also where he 
put those documents in case he requires them for supporting purposes. This 
additional burden on the memory, combined with, the human tendency to for- 
get, mgaTig that not only the information but also the retrieval of documents 
become more and more uncertain. 


3) The third factor causing the low usage of the OCR files is the virtual restrict- 
ion of present opera-lioii to answeriiig research, type questions* 




evidenced by the logical complexity of the questions and by the necessity for 
the composite group to act as an intermediary between the researcher and 
the file. Such usage by its very nature is not likely to be large, since re- 
search is time consuming and the file is used primarily to provide the initial 
starting point. In part, this restriction to a very limited type of usage is a 
historical situation and probably the result of OCR position in the CIA organ- 
ization. In effect, it has served as the supporting library for the research 
function alone. 


4) The first three causes discussed are to a large extent the result of the very 
nature of the work carried out by analysts. The fourth cause, however, is 
quite different. The OCR file has, unfortunately, developed a reputation 
among users for providing a slow and cumbersome response. There is thus 
a resulting lack of confidence and a feeling that the information wiU arrive 
too late to be useful anyway. The obvious effect of such a lack of confidence 
is the avoidance of using the file. 

5) This unfortunate situation has been enormously complicated by the historical 
compartmentalization of intelligence work. The effects of security are ob- 
vious by its nature. However, in addition, the effects of both intra-agency 
anH inter-agency politics have probably also been operative. To a certain 
extent this politics may be based on a valid consideration: namely, the nec- 
essity of adequately supporting the specialized needs of the individual com- 
ponent. Unfortunately, there is a resulting tendency to carry this to extremes 
and to perpetuate the compartmentalization in the usage of the files. Thus, 
OCR has not been effectively used as a central Information service for the 
intelligence community. 
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la view of this over-riding fact of the small usage of OCR tiles th^ t, ■ 
tion of OCR one ration u- ® ' '-''-'H tiles, the mechaniza- 

that h . questionable. Indeed, it was our impression 

mvolved £„, adequate utaohine logieal puocesatug. 

to conolualoM. assumtog that the preaeui conditions of usage MU continue 
ssentiaUy agree with those ImpUed by previous investigation.. In other words if ’ 

::rrh“ r ~ 

should TT' ““ status of usage 

“ “• -ge - 

section of to mechanisation are then valid. The remaining 

ZTL T " wiy sutoa 

moreased usage and what should be the corresponding effects upon mechanlza- 


SECRET 


Sanitized Copy Approved for Release 2009/11/23 : CIA-RDP82T00271R0001 00120001 -9 


Sanitized Copy Approved for Release 2009/11/23 : CIA-RDP82T00271R0001 00120001-9 


SECRET 


13 


IV. THE RQT.K OF THE OCR FILE. 

TTff mechanization 

the GENERAL ROLE OF AK INFORMATION SliSTEM 

formation systems can be used for any one. or several, of a number of 

„rposes. SpeoIRoally. teey can be used for sRnple accumulation or ^cbryM^e. 

bey can bo used for interrogation concerning specific foots; fliey can be used as re 
micb teol wlHi "iudgnient.. scanning as a critical element; finally «iey cm be used as 

a basis for file-generated analysis and reporting of information. Eachoftbeso 

purposes Is to a certain eatent present In the operation of any ^ 

though generaUy one or two are of a predominant importance. In new of this , ^ 
IXL are iLedlately relevant to the purposes of this report; tee first relates to 
the role of the OC® flies as the Informaaon system tor national centra intelUgenc^an 
therefore how It should be used; the second relates to the role of mec aniza ion wi 
respect to implementiBg the OCR functions. 


j. ROLE IN CENTRAL INTELLIGENCE 


Rt fbe past there has been an almost tradltloma division of the 
« -tv and its supporting functions. This division is a result of several factors, 
l^de the need by line activities for direct IntelUgence support closely attuned to the 
ndlvldual line retpHrements. flie highly slgulflcant rcTilrenuents of ^ 

flv the psvchological makeup Of intelligence workers themselves. In part, 

rest« rLi Of nsnge of OCR flies Is a areet reanlt of toe historical .ot of 
Lts position as the supporting library for a single division of a, single agency. 

Despite this historical state of affairs, however, there has become evident the 

* T H -Titellieence activity — not as a replacement for the traditional me 
need for centralized intelligence acuviiy o„nnort of 

responsive intelligence groups but as an additional InteUIgence oapabil^ m 
overall national strategic needs. Such a centralised intelligence activity 

be derived from toe treduently oonDlotlng reports of toe line groups. 
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Once the necessity tor a centralised national Intelligence responsibility has been 
recognised, the concomndttant need tor a centralised supporting Information system 
so becomes immediately evident. It is only by the consolidation ol aU relevant Intor- 
-hon at a single point that Its reconciliation and analysis In terms of overall r^l 
menm canhe accomplished. Once such a centralised tile enlsts. the economics o' 
operation require that It act as a supporting Information system for the entire IntelU- 
gence rammumts-. In this »ay, although the Individual line groups may require 
specialist mformatlcn systems to meet their specialised requirements, they »ill 
dravr on the centralised information system for their basic datai In turn, they will pro- 
vide the input to the centralised system of the results of their own analysis. 
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C. THE NEED FOH RESPONSIVENESS 


The great difficulty with a centraUsed Information system, however. Is its respon- 

Ter » th' ? - •“ -O-l-oonts of the 

r. If the traditioiml reaction against centralisation Is to be overcome, the central- 
ised informaaon system must provide a quality of service at least equivalent to that 
previously obtained. 

„ , '^spo-vlveness. as Indicated, has two aspects which are closely related, m 
e o owing, three levels of response are discussed: the first (called "research 
response") Involves oomplen, sometimes badly formulated, requirements but with a 
relabvely relaxed time retpilrement; the second (called "Indioatlons response") requires 

r tiTT, Id terms o£ an Intnitlvely understood total situation: and 

e r ca ed "intormation response") involves the answering of speolfio, well-defined, 
airly simple retjuests on a reasonably rapid time scale. 

m the past the major emphasis of the OCR library has been in terms of response 
of the research fype. (As indicated above, this is probably a daxect result of its organ- 
izational position as the supporting library for a single division of CIA). By its very 
nature, it is unlikely that such activity will ever be of large volnme. Similarly, impor- 
tant though the indications response is in the overall intelligence picture, it again is 
unlikely that an information system of any presently available type will be involved to a 
great extent, because of the critical role played in indications response by human judg- 
ment. On the other hand, it is our feeling that information response requirements will 
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involve enormoue volume, 1£ the oepabimy I. present ,o pro.lde the serve, -cessa^ 
formation response requirements have not previously been a siftnihcant toetor in 
operations; yet they represent a critically Important intelllsence capability and usually 
constitute the major load in other large size information systems. 

To illustrate the importance of information response, consider even the indi- ^ 
vidual research analyst. He Is concerned with three types of situations each correspon.- 
ing to one of the response patterns described above. Thus be Is a research specialist 
in some area. As such it is his responsibility to develop a total picture of that area, to 
define and research unresolved questions, and to convey background information to 
others in a form adequate tor their usage. On the other hand, being the 
area, he is In the posfflon to evaluate the significance of specific events and, ,f thy ar 
critical, to quickly deOne the strategic situation. He .has has the responsibilliy of pro- 
viding Immediate indications response to critical sltaallcns at the earliest posmh e 
moment - presumably on the basis of Mormatlon disseminated to him. Finally , ow 
ever, the Individual analyst, being an ezpert In an area. Is also In the best portion to 
provide intelligent evaluation of the signillcance of mformauen which, olthongh ny 
necessarily crltleal from his standpoint. Is certainly Involved with the inter-relation 

ship between Ms area and others*. 

It is this third situation wherein the need for Irformatlon response Is significant 
TO illustrate its Importanee, consider the foUowlng hypothetical situation. A spea ic 
analyst "A" is responsible for Iraq. During the past several weeks he has yen 
principally concerned with two problems. The first is some basic research ^ the 
^ture Of L communist activities in Iraq (Ihe struemre o, the local parly, its from 
activities. Its agents in governmental positions, etc.). He has also been 
with the possibilities of nationalist revolution against the existing rule as a particular 
orltioal problem area. Documents disseminated to him have been vey na ur y 
ecanned primarily for their relevancy to these two problems, although -re H wamh 
has been kept for other critical situations. Suddenly, the Israeli Invasion of yypt 
arises as a critical situation. The Iraqi expert should be able to provide qu.y and 
reliable information response on the significance of this invasion on relabonslups 
between Iraq and Egypt. Similarly, the experts on other areas, such as yrm, 

Turkey, etc. . should be able to provide the same quick and reliable Inlormation 
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This example should illustrate a number of significant points: 

1) The response required is certainly not of the same na.ture as a research 
activity. The aim is primarily to provide information, although certain- 
ly screened by the evaluation of the expert analyst involved. 


2 ) 

3) 


4) 


On otIi6r iisjiciy it is 
recognition of a critical 


certaiuly not an indications response wherein the 
situation is the significant element. 


It is unlikely that most analysts would be in. a position to provide the In- 
formation required out of their own memories or even out of their own 
files. The information required in most cases will not have been upper- 
most in their minds and thus will have been at best vaguely remembered. 

Furthermore, if the realities of personnel turnover are considered, all 

too frequently ^he.analyst, if newly arrived, will not even have seen the 
relevant information. 


5) If the capabiUty is present for providing to each analy st the Information 
reqmred for him to evaluate and answer such precisely defined situations, 
this capabiliiy certainly will be utilized. As confidence is gained, the 
number of situations to which this type of response will be made will be 

increased and the overall effectiveness of our strategic knowledge MU be 
improved. 


Some measure of the effect upon present OCR operation of the requirements for 
information response in such a critical situation can be gained by considering the 
volume of activity folloMng the outbreak of the Israeli-Egypt crisis. At that time the 
number of daily requests serviced by the OCR General File increased from twelve to 
fifty, and the service requested was pro^a&d only by going into multiple shift operation 
It seems clear, that the fourfold increase in number of requests was a. direct result of 
an information response requirement. It further is apparent that such a usage of the 
library was essentially extraordinary rather than usual. 

The importance pf information response has been illustrated in terms of the 
indiwdual research analyst. Yet clearly the need for information must exist through- 
out e operation of CIA and the intelligence community in general. The other 
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divisions of CIA, the field operations, the other intelligence agencies - each must re- 
require a maxinaum of specific information for effectiveness in its operation. But, 
from personal experience, it is evident to us that they frequently do not have easy 
access to information which must have been stored in OCR files. If rapid information 
response were available from OCR and were utilized by those requiring it, our total 
intelligence position would immeasurably improve. 

A number of conclusions can be made: 

1) OCR files are not being used for information response situations except 
under extraordinary circumstances. 

2) The potential of information response as a tool in CIA analytical work 
is un-utilized. 

3) The operation of OCR, its mechanization, and its relationship to the 
analyst should be modified to provide information response service. 

This capability, not only of the OCR file, but also of the analyst, should 
be exploited to the utmost. 

4) H this situation is developed, the usage of OCR in terms of number of 
daily requests from CIA analysts very probably will increase from 
the present 100 per day to about one thousand per day. 

5) As confidence increases in the capability of OCR to provide rapid in- 
formation response, its usage not only by the research analysts 
within CIA but by the entire intelligence commimity will correspond- 
ingly increase. The resulting usage may add another one thousand to 
two thousand requests per day. 

6) In this way , the OCR files can become a core library system for the 
entire intelligence community with only specialized peripheral libraries 
being necessary outside OCR. 
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D. THE ROLE OF MECHANIZATION 

An increased activity of the type described <na:n dys from the present 100 per 
day to 2,000 to 3,000 per day) can certainly be handled by manual systems. In fact, 
many of the systems previously mentioned are entirely manual and are handling 
volumes at least this great. However, the economics of mecbaniTiatifon becomes 
extremely favorable as larger volumes are involved. This fact has led each of the 
large information systems presently involving manual operation to consider techniques 
for mechanization. In doing so, the existence of an adequate manual system hai^ 
usually represented a severe stumbling block because of the difficulties in conversion. 

OCR, on the other hand, is in the position of not having a system adequate for 
this type of service. It, therefore, can very easily consider mechanization since, in 
any event, a conversion of operation will be necessary if the recommended service Is 
to be provided. Furthermore, the conversion itself is very much facilitated by the 
present semi-mechanized’ state of file index records. 

The questions arise, however, as to what role mechanization can fi)l in the 
operation of OCR. First, as has been repeatedly emphasized, large volume is an 
essential element for a mechanized system. It presumably has the capacity to handle 
the volume at a lower total operating cost than that of an equivalent manual system. 

On the other hand, it needs the volume to justify the expense (mechanized installations 
generally have a high unit cost and a high installation and conversion cost in compari- 
son with manual systems) , 

Second, however, mechanized systems must operate in a well-defined environ- 
ment. Economic operation depends upon organization for efficiency and TnayiTnurp usage 
In addition, machinery by its very nature must be programmed with all operations finely 
detailed. In particular , all judgment and decision functions must be reduced to program 
med operations. This means that undefined judgment and, in general, human contact 
with the systeija can become remote and difficult. 

Third, the availability of mechanized information provides the possibility for ex- 
ploitation of this data through file-generated activity. This means that many requests 
can be pre— answered and that the file can be continually re— organized for efficiency in 
response. Furthermore, such file-generated activity is usually an essential factor in 
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economic operation. If effectivplv noori u < 

replaces „ me acvp,; ^ :Tzr' 

provide service prior to need. ^ 

It is clear that the present: conditions of operation completely fail to meer .h 

fl.ree criteria lor effective eieCaalaaffon. I„ laC,, OCR is co„ce:„ed Ua s n"' 
volume of renuAcfa ...... , > ned Avirh a small 

no and .Here Is 

On the other hand, if the recommended service of infer mat 
Pleniented. ..ere .11 .e a ^anp-leid increase t ZZi::! 'I T 

o<™“ 

~r 

response is to be provided to the lara-e volume nf infn i ^ 

need for the file tn h of inf ormaition requests. This Implies a 

P . K also implies an awareness by the file operation ot its own states and activity 
antoegh routine statistical and simple analytical wort suitable to meohaulsad operaUou. 

To implement the preoedlpg, we oan detlne . set of gross speolfloatlons on the 
requirements for mechanization of OCR.- 

1) Capacity for large volume documentary and information storage com- 

ined with capacity for handling large volume usage is essential. 

2) Provision must be made for capability of close human contact. /Few 

mechamzed systems provide this capability, yet the need for main- 
taining at least the present level of service to research type requests 
IS an essential element ol continued OCR operation. Close human 
contact is required to develop confidence In the 111,, understand Its 
contents, and easily apply human judgment where It may be essential.) 

3) TO implement the need lor Information response, the mechanisation must 
provide rapid, inexpensive access to well-defined information. 

4) Processing capabillly must provide awareness of the changing environ. 

ment of input and requests and must support the research function with 

mechamzed access to patt;eniSo 
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5) Rapid file handling capability for reporting, file organization and reorgani- 
zation, and other file-generated activity must be inexpensive and easy. 
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V- relations hip between OCR mechanization 

AND THE OVERALL MECHANIZATION OP PTA 


EssentiaUy CIA Itself represents an enormous information processing system 
collection, analysis, storage and retrieval, translation, processing, statistical 
analysis - all being involved. We do not know the status of mechanization of overaU 
CIA operation. It is clear, however, that such a complex information processing 
system must eventuaHy be aided in large part by mechanization. Consideratinn of 
economics, speed of response, and necessary processing capability all combine to 
force it upon CIA. It is essential however, that mechanization not be developed in a 
piece-meal fashion. The very close tie-in between the analytical process, the input 
and translation process, and the information storage and retrieval process is merely 
one evidence of this need. There is thus a necessity for an Integrated picture and a 
definition of common elements throughout the operation. 

Jn this report we have been able to discuss only those considerations relevant 
to OCR operation. The conclusions must therefore be limited in their awareness of 
overall alms and needs. We recommend that a completely integrated study of total 
CIA requirements be implemented. 
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